TweetGeo - A Tool for Collecting, Processing and Analysing Geo-encoded Linguistic Data
نویسندگان
چکیده
In this paper we present a newly developed tool that enables researchers interested in spatial variation of language to define a geographic perimeter of interest, collect data from the Twitter streaming API published in that perimeter, filter the obtained data by language and country, define and extract variables of interest and analyse the extracted variables by one spatial statistic and two spatial visualisations. We showcase the tool on the area and a selection of languages spoken in former Yugoslavia. By defining the perimeter, languages and a series of linguistic variables of interest we demonstrate the data collection, processing and analysis capabilities of the tool.
منابع مشابه
Analysing Names of Organic Chemical Compounds -- From Morpho-Semantics to SMILES Strings and Classes (Web Version)
The linguistic analysis of chemical terminology is a key to biochemical text processing and semi-automatic database curation. The system described analyses systematic and semi-systematic names of chemical compounds, class terms, and also otherwise underspecified names by means of a morpho-semantic grammar developed according to IUPAC nomenclature. It yields an intermediate semantic representati...
متن کاملLexicalization vs. Vocalization: A Cross-Linguistic Study of Emphasis in English and Persian
Language is a system of verbal elements that makes communication of meaningspossible in the manners the users intend by employing certain linguistic deviceswhich are partly language-specific. Once communicating cross-linguistically, thereis always a risk of negative transfer of techniques or processes from the firstlanguage (L1) to the foreign language (L2). The current study investigates the“e...
متن کاملAnalysing Price, Quality and Lead Time Decisions with the Hybrid Solution Method of Fuzzy Logic and Genetic Algorithm
In this paper, the problem of determining the quality level, lead time for order delivery and price of a product produced by a manufacturer is considered. In this problem the demand for the product is influenced by all three decision variables: price, lead time and quality level. To formulate the demand function, a fuzzy rule base that estimates the demand value based on the three decision vari...
متن کاملWorldlikeness: A Web-based Tool for Typological Psycholinguistic Research
In this paper, we introduce Worldlikeness, a web-based tool for collecting and sharing cross-linguistic wordlikeness judgments (nonce word acceptability judgments) to facilitate typological psycholinguistic research. Typological psycholinguistic research is essential since crucial factors affecting language processing vary across languages, but these factors often too confounded to tease apart ...
متن کاملAnalysing Youth Inclination to Body Management and Organs Control
The human body is endowed with varied forms of social significance, and it is this which sociology has addressed by asking questions such as: Towhat degree do individuals have control over their own bodies? Using familiar examples from everyday life, such as diet and exercise regimes,personal hygiene, dress, displays of emotion, and control over bodilyfunctions. This research investigates the b...
متن کامل